Sheffield | 25-SDC-Nov | Sheida Shabankari | Sprint 2 |Improve code with precomputing by sheida-shab · Pull Request #113 · CodeYourFuture/Module-Complexity

sheida-shab · 2026-02-18T14:05:21Z

I have titled my PR with Region | Cohort | FirstName LastName | Sprint | Assignment Title
My changes meet the requirements of the task
I have tested my changes
My changes follow the style guide

PR summary :

Improved find_longest_common_prefix using two techniques:

Precomputing prefix hashes for each string to avoid repeated character-by-character comparisons.
Sorting the strings so that only adjacent strings need to be compared, which greatly reduces the number of comparisons for large lists.
These changes make the function much faster while keeping all existing tests passing, including the large list test.

Improved count_letters by precomputing lowercase letters for faster lookup. Only uppercase letters without a lowercase version are counted. All tests, including long strings, pass.

…refix

… hashes

cjyuan

Can you use complexity to explain why your implementation is an improvement?

cjyuan · 2026-02-19T21:01:46Z

Sprint-2/improve_with_precomputing/common_prefix/common_prefix.py

+    # Precompute prefix hashes for each string to speed up comparisons
+    prefix_map={s:[hash(s[:i+1]) for i in range(len(s))] for s in strings}


How do these "prefix hashes" speed up comparison?

At first, I thought the exercise needed me to precompute and keep some data for each string. So I used prefix hashes, thinking this would avoid comparing letters one by one and make the function much faster.
But after testing, I saw that using hashes did not make it much faster. Sorting the strings and comparing only neighbors worked better, was simpler, and gave the speed improvement needed.

Note: With the way you used prefix hashes, the complexity actually became higher (when the length of each string is large).

That makes sense. In my case, prefix hashing introduced additional preprocessing and memory costs without improving the asymptotic complexity. Once I removed it and only compared adjacent strings after sorting, the solution became simpler and faster, especially for longer inputs.

sheida-shab · 2026-02-19T23:30:29Z

Hi CJ,Thanks for your feedback.
I improved count_letters by reducing time complexity from O(n²) to O(n).

Previously, for each character in the string (O(n)), the code performed a membership check on the entire string (O(n)), resulting in O(n²) complexity.

By precomputing a set of all lowercase characters once (O(n)) and using O(1) set lookups inside the loop, the overall complexity is reduced to O(n) .

cjyuan · 2026-02-20T02:06:42Z

Can you also do a complexity analysis for find_longest_common_prefix?

sheida-shab · 2026-02-20T18:02:16Z

The time complexity of find_longest_common_prefix now is O(n log n + n · m), where n is the number of strings and m is the average length of a string.
Sorting the strings takes O(n log n), and then comparing only adjacent strings takes O(n · m) in the worst case.

cjyuan · 2026-02-20T19:32:19Z

How does the program determine the order between two strings? Is the performance affected by string length when we compare two strings? If we take into account m, then the sorting complexity is no longer just O(nlogn).

sheida-shab · 2026-02-21T11:01:12Z

Thanks for your feedback!
The ordering between two strings during sorting is determined lexicographically in Python, by comparing characters one by one until a difference is found or one string ends.
Comparing two strings therefore takes O(m) time in the worst case, where m is the string length.

Since sorting performs O(n log n) such comparisons, the actual cost of sorting strings is O(n log n · m), not just O(n log n).
After sorting, comparing adjacent strings takes O(n · m) in the worst case.

Thus, the overall time complexity is:

O(n log n · m)

cjyuan · 2026-02-21T11:58:11Z

Spot on.

sheida-shab added 3 commits February 18, 2026 13:24

Implemented precomputing using prefix hashes in find_longest_common_p…

a312f12

…refix

Optimize find_longest_common_prefix using sort and precomputed prefix…

d57e451

… hashes

Precompute lowercase letters to speed up count_letters

cf45392

sheida-shab added Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. Module-Complexity The name of the module. labels Feb 18, 2026

cjyuan reviewed Feb 19, 2026

View reviewed changes

cjyuan added Reviewed Volunteer to add when completing a review with trainee action still to take. and removed Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. labels Feb 19, 2026

sheida-shab added Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. and removed Reviewed Volunteer to add when completing a review with trainee action still to take. labels Feb 19, 2026

cjyuan added Reviewed Volunteer to add when completing a review with trainee action still to take. and removed Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. labels Feb 20, 2026

Refactor finding longest common prefix

d1d59ab

sheida-shab added Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. and removed Reviewed Volunteer to add when completing a review with trainee action still to take. labels Feb 20, 2026

cjyuan added Reviewed Volunteer to add when completing a review with trainee action still to take. and removed Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. labels Feb 20, 2026

sheida-shab added Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. and removed Reviewed Volunteer to add when completing a review with trainee action still to take. labels Feb 21, 2026

cjyuan added Complete Volunteer to add when work is complete and all review comments have been addressed. and removed Needs Review Trainee to add when requesting review. PRs without this label will not be reviewed. labels Feb 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

Sheffield | 25-SDC-Nov | Sheida Shabankari | Sprint 2 |Improve code with precomputing#113

Sheffield | 25-SDC-Nov | Sheida Shabankari | Sprint 2 |Improve code with precomputing#113
sheida-shab wants to merge 4 commits intoCodeYourFuture:mainfrom
sheida-shab:Feat-improve-with-precomputing

sheida-shab commented Feb 18, 2026

Uh oh!

cjyuan left a comment

Uh oh!

cjyuan Feb 19, 2026

Uh oh!

sheida-shab Feb 19, 2026

Uh oh!

cjyuan Feb 20, 2026 •

edited

Loading

Uh oh!

sheida-shab Feb 20, 2026

Uh oh!

sheida-shab commented Feb 19, 2026

Uh oh!

cjyuan commented Feb 20, 2026

Uh oh!

sheida-shab commented Feb 20, 2026

Uh oh!

cjyuan commented Feb 20, 2026

Uh oh!

sheida-shab commented Feb 21, 2026

Uh oh!

cjyuan commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# Precompute prefix hashes for each string to speed up comparisons
		prefix_map={s:[hash(s[:i+1]) for i in range(len(s))] for s in strings}

Uh oh!

Comments

Conversation

sheida-shab commented Feb 18, 2026

Uh oh!

cjyuan left a comment

Choose a reason for hiding this comment

Uh oh!

cjyuan Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

sheida-shab Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

cjyuan Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sheida-shab Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

sheida-shab commented Feb 19, 2026

Uh oh!

cjyuan commented Feb 20, 2026

Uh oh!

sheida-shab commented Feb 20, 2026

Uh oh!

cjyuan commented Feb 20, 2026

Uh oh!

sheida-shab commented Feb 21, 2026

Uh oh!

cjyuan commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cjyuan Feb 20, 2026 •

edited

Loading